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Abstract 

We prove that the graph tautology formulas of Alekhnovich, Jo- 
hannsen, Pitassi, and Urquhart have polynomial size pool resolution 
refutations that use only input lemmas as learned clauses and with- 
out degenerate resolution inferences. We also prove that these graph 
tautology formulas can be refuted by polynomial size DPLL proofs 
with clause learning, even when restricted to greedy, unit-propagating 
DPLL search. We prove similar results for the guarded, xor-fied peb- 
bling tautologies which Urquhart proved are hard for regular resolu- 
tion. 
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1 Introduction 



The problem SAT of deciding the satisfiability of propositional CNF for- 
mulas is of great theoretical and practical interest. Even though it is NP- 
complete, industrial instances with hundreds of thousands variables are rou- 
tinely solved by state of the art SAT solvers. Most of these solvers are based 
on the DPLL procedure [13] extended with clause learning, restarts, variable 
selection heuristics, and other techniques. 

The basic DPLL procedure without clause learning is equivalent to tree- 
like resolution. The addition of clause learning makes DPLL considerably 
stronger. In fact, clause learning together with unlimited restarts is capable 
of simulating general resolution proofs |19| . However, the exact power of 
DPLL with clause learning but without restarts is unknown. This question is 
interesting not only for theoretical reasons, but also because of the potential 
for better understanding the practical performance of various refinements of 
DPLL with clause learning. 

Beame, Kautz, and Sabharwal [3] gave the first theoretical analysis of 
DPLL with clause learning. Among other things, they noted that clause 
learning with restarts simulates general resolution. Their construction re- 
quired the DPLL algorithm to ignore some contradictions, but this situ- 
ation was rectified by Pipatsrisawat and Darwiche [19] who showed that 
SAT solvers which do not ignore contradictions can also simulate resolution. 
These techniques were also applied to learning bounded width clauses by 
Atserias et al. [2]. 

Beame et al. [3] also studied DPLL clause learning without restarts. Us- 
ing a method of "proof trace extensions" , they were able to show that DPLL 
with clause learning and no restarts is strictly stronger than any "natural" 
proof system strictly weaker than resolution. Here, a natural proof system 
is one in which proofs do not increase in length substantially when variables 
are restricted to constants. The class of natural proof systems is known to 
include common proof systems such as tree-like or regular proofs. The proof 
trace method involves introducing extraneous variables and clauses, which 
have the effect of giving the clause learning DPLL algorithm more freedom 
in choosing decision variables for branching. 

There have been two approaches to formalizing DPLL with clause learn- 
ing as a static proof system rather than as a proof search algorithm. The 
first is pool resolution with a degenerate resolution inference, due originally 
to Van Gelder [23] and studied further by Hertel et al. [15] . Pool resolution 
requires proofs to have a depth-first regular traversal similarly to the search 
space of a DPLL algorithm. Degenerate resolution allows resolution infer- 
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ences in which one or both of the hypotheses may be lacking occurrences of 
the resolution literal. Van Gelder argued that pool resolution with degen- 
erate resolution inferences simulates a wide range of DPLL algorithms with 
clause learning. He also gave a proof, based on [T], that pool resolution with 
degenerate inferences is stronger than regular resolution, using extraneous 
variables similar to proof trace extensions. 

The second approach is due to Buss et al. p2] who introduced a "partially 
degenerate" resolution rule called w-resolution, and a proof system regWRTI 
based on w-resolution and clause learning of "input lemmas" . They proved 
that regWRTI exactly captures non-greedy DPLL with clause learning. By 
"non-greedy" is meant that contradictions may need to be ignored by the 
DPLL search. 

Both |15] and [12] gave improved versions of the proof trace extension 
method so that the extraneous variables depend only on the set of clauses 
being refuted and not on resolution refutation of the clauses. The drawback 
remains, however, that the proof trace extension method gives contrived sets 
of clauses and contrived resolution refutations. 

It remains open whether any of DPLL with clause learning, pool resolu- 
tion (with or without degenerate inferences), or the regWRTI proof system 
can polynomially simulate general resolution. One approach to answering 
these questions is to try to separate pool resolution (say) from general res- 
olution. So far, however, separation results are known only for the weaker 
system of regular resolution, based on work of Alekhnovich et al. [I], who 
gave an exponential separation between regular resolution and general res- 
olution. Alekhnovich et al. [1] proved this separation for two families of 
tautologies, variants of the graph tautologies GT' and the "Stone" pebbling 
tautologies. Urquhart [22] subsequently gave a related separation using a 
different set of pebbling tautologies which he denoted denoted IL0 In the 
present paper, we call the tautologies GT' the guarded graph tautologies, 
and henceforth denote them GGT instead of GT'; their definition is given 
in Section EJ We define the formulas GPeb fc ®(G) in Section [5] these are 
essentially the IT tautologies of Urquhart. 

An obvious question is whether pool resolution (say) has polynomial size 
proofs of the GGT tautologies, the GPeb fc ®, or the Stone tautologies. The 
present paper resolves the first two questions by showing that pool resolution 
does indeed have polynomial size proofs of the graph tautologies GGT and 

1 Huang and Yu [16] also gave a separation of regular resolution and general resolution, 
but only for a single set of clauses. Goerdt [14] gave a quasipolynomial separation of 
regular resolution and general resolution. 
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the pebbling tautologies GPeb fc ®. Our refutations avoid the use of extra- 
neous variables in the style of proof trace extensions; furthermore, they use 
only the traditional resolution rule and do not require degenerate resolution 
inferences or w-resolution inferences. In addition, we use only learning of 
input clauses; thus, our refutations are also regWRTI proofs (and in fact 
regRTI proofs) in the terminology of |12| . As a corollary of the characteri- 
zation of regWRTI by [12], the GGT principles and the GPeb fc ® principles 
have polynomial size refutations that can be found by a DPLL algorithm 
with clause learning and without restarts (under the appropriate variable 
selection order). 

The Stone principles have recently been shown to also have regRTI refu- 
tations by the second author and L. Kolodziejczyk [TO]; however, their proof 
uses a rather different method than we use below. Thus, none of the three 
principles separate clause learning DPLL from full resolution. It is natural 
to speculate that perhaps pool resolution or regWRTI can simulate general 
resolution, or that DPLL with clause learning and without restarts can sim- 
ulate general resolution. It is far from clear that this is true, but, if so, our 
methods and those of |10] may represent a step in this direction. 

The outline of the paper is as follows. Section [2] begins with the defini- 
tions of resolution, degenerate resolution, and w-resolution, and then regu- 
lar, tree, and pool resolution. After that, we define the graph tautologies 
GT n and the guarded versions GGT n , and state the main theorems about 
proofs of the GGT n principles. Section [3] gives the proof of these theorems. 
Several ingredients are needed for the proof. The first idea is to try to fol- 
low the regular refutations of the graph tautology clauses GT n as given by 
Stalmarck [21] and Bonet and Galesi [!), : however, these refutations cannot 
be used directly since the transitivity clauses of GT n are "guarded" in the 
GGT n clauses and this yields refutations which violate the regularity/pool 
property. So, the second idea is that the proof search process branches as 
needed to learn transitivity clauses. This generates additional clauses that 
must be proved: to handle these, we develop a notion of "bipartite partial 
order" and show that the refutations of [21[ |9] can still be used in the pres- 
ence of a bipartite partial order. The tricky part is to be sure that exactly 
the right set of clauses is derived by each subproof. Some straightforward 
bookkeeping shows that the resulting proof is polynomial size. 

Section|J]discusses how to modify the refutations constructed in Section[3] 
so that they are "greedy" and "unit-propagating". These conditions means 
that proofs cannot ignore contradictions, nor contradictions that can be 
obtained by unit propagation. The greedy and unit-propagating conditions 
correspond well to actual implemented DPLL proof search algorithms, since 
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they backtrack whenever a contradiction can be found by unit propagation. 
Section 2] concludes with an explicit description of a polynomial time DPLL 
clause learning algorithm for the GGT n clauses. 

Section [5] gives the pool resolution and regRTI refutations of the GPeb fc ® 
principles. The proof mimics to a certain extent the methods of Section El 
but must also deal with the complications of xor-ification. 

This paper is an expansion of an extended abstract [7] and an unpub- 
lished preprint [8] by the first two authors. These earlier versions included 
only the results for the GGT tautologies and did not consider the GPeb 
principles. 

We are grateful to J. Hoffmann for assisting with a correction to an 
earlier version of the proof of Theorem (2) We also thank A. Van Gelder, 
A. Beckmann, and T. Pitassi for encouragement, suggestions, and useful 
comments. 

2 Preliminaries and guarded graph tautologies 

Propositional formulas are defined over a set of variables and the connectives 
A, V and We use the notation x to express the negation —ix of x. A literal 
is either a variable x or a negated variable x. A clause C is a set of literals, 
interpreted as the disjunction of its members. The empty clause, □, has 
truth value False. We shall only use formulas in conjunctive normal form, 
CNF; namely, a formula will be a set (conjunction) of clauses. We often use 
disjunction (V), union (U), and comma (,) interchangeably. 

Definition The various forms of resolution take two clauses A and B called 
the premises and a literal x called the resolution variable, and produce a new 
clause C called the resolvent. 

_A B_ 

C 

In all cases below, it is required that x £ A and x £ B. The different forms 
of resolution are: 

Resolution rule. The hypotheses have the forms A := A' V x and B := 
B' V x. The resolvent C is A' V B' . 

Degenerate resolution rule. |15|. [23] If x € A and x E B, we apply the 
resolution rule to obtain C. If A contains x, and B doesn't contain x, 
then the resolvent C is B. If A doesn't contain x, and B contains x, 
then the resolvent C is A. If neither A nor B contains the literal x 
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or x, then C is the lesser of A or 5 according to some tiebreaking 
ordering of clauses. 

w-resolution rule. [12] From A and B as above, we infer C := (A \ {x}) V 
(B \ {x}). If the literal x ^ A (resp., x £ B), then it is called a 
phantom literal of A (resp., B). 

Definition A resolution derivation, or proof, of a clause C from a CNF 
formula F is a sequence of clauses C\, . . . , C s such that C = C s and such 
that each clause from the sequence is either a clause from F or is the resolvent 
of two previous clauses. If the derived clause, C s , is the empty clause, this is 
called a resolution refutation of F. The more general systems of degenerate 
and w-resolution refutations are defined similarly. 

We represent a derivation as a directed acyclic graph (dag) on the vertices 
Ci, . . . ,C S , where each clause from F has out-degree 0, and all the other 
vertices from C\, . . . , C s have edges pointing to the two clauses from which 
they were derived. The empty clause has in-degree 0. We use the terms 
"proof" and "derivation" interchangeably. 

Resolution is sound and complete in the refutational sense: a CNF for- 
mula F has a refutation if and only if F is unsatisfiable, that is, if and only 
if —iF is a tautology. Furthermore, if there is a derivation of a clause C 
from F, then C is a consequence of F; that is, for every truth assignment a, 
if a satisfies F then it satisfies C. Conversely, if C is a consequence of F 
then there is a derivation of some C C C from F. 

A resolution refutation is regular provided that, along any path in the di- 
rected acyclic graph, each variable is resolved on at most once. A resolution 
derivation of a clause C is regular provided that, in addition, no variable 
appearing in C is used as a resolution variable in the derivation. A refuta- 
tion is tree-like if the underlying graph is a tree; that is, each occurrence of 
a clause occurring in the refutation is used at most once as a premise of an 
inference. 

We next define a version of pool resolution, using the conventions of |12j 
who called this "tree-like regular resolution with lemmas" or "regRTL" . The 
idea is that clauses obtained previously in the proof can be used freely as 
learned lemmas. To be able to talk about clauses previously obtained, we 
need to define an ordering of clauses. 

Definition Given a tree T, the postorder ordering <t of the nodes is defined 
as follows: if u is a node of T, v is a node in the subtree rooted at the left 
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child of u, and w is a node in the subtree rooted at the right child of u, then 

V <T W <T U. 

Definition A pool resolution proof (also called a regRTL proof) from a 
set of initial clauses F is a resolution proof tree T that fulfills the following 
conditions: (a) each leaf is labeled with either a clause of F or a clause 
(called a "lemma") that appears earlier in the tree in the <t ordering; 
(b) each internal node is labeled with a clause and a literal, and the clause 
is obtained by resolution from the clauses labeling the node's children by 
resolving on the given literal; (c) the proof tree is regular; (d) the roof is 
labeled with the conclusion clause. If the labeling of the root is the empty 
clause □, the pool resolution proof is a pool refutation. 

The notions of degenerate pool resolution proof and pool w-resolution 
proof are defined similarly, but allowing degenerate resolution or w-resolution 
inferences, respectively. The two papers (23J [15] defined pool resolution to 
be the degenerate pool resolution system, so our notion of pool resolution is 
more restrictive than theirs. Our definition is equivalent to the one in 
however. It is also equivalent to the system regRTL defined in |12j . Pool 
w-resolution is the same as the system regWRTL of [12J. 

A "lemma" in clause (a) of the above definition is called an input lemma 
if it is derived by input subderivation, namely by a subderivation in which 
each inference has at least one hypothesis which is a member of F or is a 
lemma. The notion of input lemma was first introduced by [12J. In their 
terminology, a pool resolution proof which uses only input lemmas, is called 
a regRTI proof. Likewise a regWRTL proof that uses only input lemmas is 
called a regWRTI proof. 

To understand the nomenclature; "reg" stands for "regular" , "W" for "w- 
resolution", "RT" for "resolution tree", "L" for lemma, and "I" for "input 
lemma" . 

Next we define various graph tautologies, sometimes also called "ordering 
principles". They will all use a size parameter n > 1, and variables Xij with 
i,j G [n] and i ^ j, where [n] = {0, 1, 2, . . . , n— 1}. A variable x^j will 
intuitively represent the condition that i -< j with -< intended to be a total, 
linear order. We will thus always adopt the simplifying convention that Xij 
and the identical literal, i.e., only the variables Xij for i < j actually 

exist, and Xj t i for j < i is just a notation for x~i 7 j, and Xj^ stands for x^j. 
This identification makes no essential difference to the complexity of proofs 
of the tautologies, but it reduces the number of literals and clauses, and 
simplifies the definitions. In particular, it means there are no axioms for the 
antisymmetry or totality of -<. 
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The following principle is based on the tautologies defined by Krishna- 
murthy [18]. These tautologies, or similar ones, have also been studied by 

[2D 13 El Uracil]. 

Definition Let n > 1. Then GT n is the following set of clauses involving 
the variables Xij, for i,j€ [n] with i ^ j. 

(00) The clauses Vjy? x i,i-> eacn value i < n. 

(70) The transitivity clauses & := Xij V~Xj,k Vi^ for all distinct i,j, k 
in [n]. 

Note that the clauses T^j^, Tj & j and Tj^jj are identical. For this reason 
Van Gelder [23] uses the name "no triangles" (NT) for a similar principle. 

The next definition is from [lj, who used the notation GT^. They 
used particular functions r and s for their lower bound proof, but since 
our upper bound proof does not depend on the details of r and s we leave 
them unspecified. We require that r(i,j,k) ^ s(i,j,k) and that the set 
{r(i,j,k),s(i,j,k)} £ {i,j,k}. In addition, w.l.o.g., r(i,j,k) = r(j,k,i) = 
r(k,i,j), and similarly for s. 

Definition Let n > 1, and let r(i,j,k) and s(i,j,k) be functions mapping 
[n] 3 — > [n] as above. The guarded graph tautology formula GGT n consists of 
the following clauses: 

(ag) The clauses Vjy? x i,i-> f° r eacn value i < n. 

(7^) The guarded transitivity clauses T%^^ V x T)S and T%j,k V ^r,s> for all 
distinct i, j, k in [n], where r = r(i,j, k) and s = s(i,j,k). 

Note that the GGT n clauses depend on the functions r and s; this is sup- 
pressed in the notation. Our main result for the guarded graph tautologies 
is: 

Theorem 1 The guarded graph tautology formulas GGT n have polynomial 
size pool (regRTL) resolution refutations. 

The proof of Theorem Q] will construct pool refutations in the form of 
regular tree-like refutations with lemmas. A key part of this is learning 
transitive closure clauses that are derived using resolution on the guarded 
transitivity clauses of GGT n . A slightly modified construction, that uses a 
result from [12], gives instead tree-like regular resolution refutations with 
input lemmas. This will establish the following: 
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Theorem 2 The guarded graph tautology formulas GGT n have polynomial 
size, tree-like regular resolution refutations with input lemmas (regRTI refu- 
tations). 

A consequence of Theorem is that the GGT n clauses can be shown unsat- 
isfiable by non-greedy polynomial size DPLL searches using clause learning. 
This follows via Theorem 5.6 of [12], since the refutations of GGT n are 
regRTI, and hence regWRTI, proofs in the sense of |12j . 

However, as shown by Theorem [6] in Section 01 we can improve the 
constructions of Theorems [1] and [2] to show that the GGT n principles can be 
refuted also by greedy and unit-propagating polynomial size DPLL searches 
with clause learning. 

3 Guarded graph tautology refutations 

The following theorem is an important ingredient of our upper bound proof. 

Theorem 3 (Stalmarck [21]; Bonet-Galesi [9]; Van Gelder [21]) The sets 
GT n have regular resolution refutations P n of polynomial size 0(n 3 ). 

We do not include a direct proof of Theorem [3] here, which can be found 
in [21], [9J [23] . The present paper uses the proofs P n as a "black box" ; the only 
property needed is that the P n 's are regular and polynomial size. Lemma [4] 
below is a direct generalization to Theorem [3] in fact, when specialized to 
the case of ir = 0, it is identical to Theorem El 

The refutations P n can be modified to give refutations of GGT n by 
first deriving each transitive clause Tj j from the two guarded transitivity 
clauses of (70). This however destroys the regularity property, and in fact 
no polynomial size regular refutations exist for GGT n [I]. 

As usual, a partial order on [n] is an antisymmetric, transitive relation bi- 
nary relation on [n]. We shall be mostly interested in "partial specifications" 
of partial orders: partial specifications are not required to be transitive. 

Definition A partial specification, r, of a partial order is a set of ordered 
pairs r C [n] x [n] which are consistent with some (partial) order. The 
minimal partial order containing r is the transitive closure of r. We write 
i -< T j to denote € r, and write i -<* j to denote that (i,j) is in the 
transitive closure of r. 

The T-minimal elements are the i's such that j -< T i does not hold for 
any j. 
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Figure 1: Example of a partial specification of a partial order (left) and the 
associated bipartite partial order (right). 

We will be primarily interested in particular kinds of partial orders, 
called "bipartite" partial orders, that can be associated with partial orders. 
A bipartite partial order is a partial order that does not have any chain of 
inequalities x <y < z. 

Definition A bipartite partial order is a binary relation tt on [n] such that 
the domain and range of tt do not intersect. The set of 7r-minimal elements 
is denoted M n . 

The righthand side of Figure [T] shows an example. The bipartiteness of tt 
arises from the fact that and [n] \ M n partition [n] into two sets. Note 
that if i -< n j, then i € M v and j £ M n . In addition, M n contains the 
isolated points of tt. 

Definition Let r be a specification of a partial order. The bipartite partial 
order tt that is associated with r is defined by letting i -< n j hold for precisely 
those i and j such that i is r-minimal and i -<* j. 

It is easy to check that the tt associated with r is in fact a bipartite partial 
order. The intuition is that tt retains only the information about whether 
i -<* j for minimal elements i, and forgets the ordering that r imposes 
on non-minimal elements. Figure [T] shows an example of how to obtain a 
bipartite partial order from a partial specification. 

We define the graph tautology GT T)n relative to tt as follows. 

Definition Let tt be a bipartite partial order on [n]. Then GT^ is the set 
of clauses containing: 

(a) The clauses V?yi x j,i-> f° r eac h value i G M n . 

(/3) The transitivity clauses Tij^ := Xij Vxj,k Vx^j for all distinct i,j,k 
in M^. (Vertices i,j,k' in Figure [2] show an example.) 
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[n] \ M v : 




Figure 2: A bipartite partial order ir is pictured, with the ordered pairs of ir 
shown as directed edges. (For instance, j -< n k holds.) The set M v is the set 
of minimal vertices. The nodes i,j,k shown are an example of nodes used 
for a transitivity axiom Xij Vxj^ Vi^j of type (7). The nodes i, j, k' are an 
example of the nodes for a transitivity axiom of type {(5). 

(7) The transitivity clauses T^j ^ for all distinct k such that i,j € M v 
and i 7^ k and j -< n k. (As shown in Figure [2j) 

The set GT^ )n is satisfiable if it is nonempty. As an example, there is 
the assignment that sets Xji true for some fixed j £ and every i € M v , 
and sets all other variables false. However, if tt is applied as a restriction, 
then GT w n becomes unsatisfiable. That is to say, there is no assignment 
which satisfies GT n<n and is consistent with ir. This fact is proved by the 
regular derivation P n described in the next lemma. 

Definition For tt a bipartite partial order, the clause is defined by 



Lemma 4 Let tt be a bipartite partial order on [n] . Then there is a regular 
derivation P v of (\Jtt) from the set GT,^. 

The only variables resolved on in P n are the following: the variables Xij 
such that i,j £ M n , and the variables a^.k such that k M W! i € M n , and 
z 7^77 k . 

Lemma 0] implies that if ir is the bipartite partial order associated with 
a partial specification r of a partial order, then the derivation P n does not 
resolve on any literal whose value is set by r. This is proved by noting that 
if i < T j, then j ^ M n . 

Note that if tt is empty, M n = [n] and there are no clauses of type (7). 
In this case, GT^ is identical to GT n , and P n is the same as the refutation 
of GT n of Theorem [3l 

Proof By renumbering the vertices, we can assume w.l.o.g. that = 
{0, . . . , m— 1}. For each k > m, there is at least one value of j such that 
j k: let Jfc be an arbitrary such value j. Note < m. 
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Fix i G M n ; that is, i < m. Recall that the clause of type (a) in GTV^, 
for i is \J We resolve this clause successively, for each k >m such 
that i y^ n k, against the clauses Tij k ^ of type (7) 

%i,J k Vi4,i; Vx fcii 

using resolution variables Xk,i- (Note that Jk 7^ i since i 7^ k.) This yields 
a clause 3^ m : 

V ^> J fc V V ^ J fc' fc V V Xfc >* V V Xfc >*- 

fc>m fc>m fc>m fc<m 

i-fink iy^nk i^-n-fc fc^i 

The first two disjuncts shown above for T-' come from the side literals of 
the clauses T^j^; the last two disjuncts come from the literals in V?yi 
which were not resolved on. Since a literal 3^,j fc is the same literal as xj fe ,i 
and since Jk < m, the literals in the first disjunct are also contained in the 
fourth disjunct. Thus, eliminating duplicate literals, T' im is equal to the 
clause 

V Wj >" k V V Xk > i V V Xk ' i ' 

k>m k>m k<m 

i-Airk i^i n k k^i 

Repeating this process, we obtain derivations of the clauses T' im for all 
i < m. The final disjuncts of these clauses, Vi^fc<m x k,i, are the same as the 
(o£q) clauses in GT m . Thus, the clauses T[ m give all (ag) clauses of GT m , 
but with literals xj k; k and Xk,i added in as side literals. Moreover, the clauses 
of type (0) in GT n n are exactly the transitivity clauses of GT m . All these 
clauses can be combined exactly as in the refutation of GT m described in 
Theorem [3j but carrying along extra side literals Xj kt k and Xk,i, or equiva- 
lently carrying along literals xj kt k for Jk -< n k, and x^k for i ^ n k. Since 
the refutation of GT m uses all of its transitivity clauses and since each xj k 
literal is also one of the x^'s, this yields a resolution derivation P n of the 
clause 

{ x i,k '■ i ^7r k}. 

This is the clause (\/W) as desired. 

Finally, we observe that P n is regular. To show this, note that the first 
parts of P n deriving the clauses T- m are regular by construction, and they 
use resolution only on variables xt y i with k > m, i < m, and i k. The 
remaining part of P n is also regular by Theorem [31 and uses resolution only 
on variables Xij with i,j<m. □ 
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Proof of Theorem [TJ We will show how to construct a series of "LR partial 
refutations", denoted Rq, R\, i?2 ; ---; this process eventually terminates 
with a pool (regRTL) resolution refutation of GGT n . The terminology "LR 
partial" indicates that the refutation is being constructed in left-to-right 
order, with the left part of the refutation properly formed, but with many 
of the remaining leaves being labeled with bipartite partial orders instead 
of with valid learned clauses or initial clauses from GGT n . We first describe 
the construction of the pool refutation, and leave the size analysis to the 
end. 

An LR partial refutation R is a tree with nodes labeled with clauses 
that form a correct pool resolution proof, except possibly at the leaves. 
Furthermore, it must satisfy the following conditions. 

a. R is a tree. The root is labeled with the empty clause. Each non-leaf 

node in R has a left child and right child; the clause labeling the node 
is derived by resolution from the clauses on its two children. 

b. For each clause C occurring in R, the clause C + and the set of ordered 

pairs t(C) are defined by 

C + := {xij : Xij occurs in some clause on the branch 

from the root node of R up to and including C}, 

and t(C) = : x i:j € C + }. Note that C C C + holds by definition. 

In many cases, r(C) will be a partial specification of a partial order, 
but this is not always true. For instance, if C is a transitivity axiom, 
then t(C) has a 3-cycle and is not consistent as a specification of a 
partial order. 

d. Leaves are either "finished" or "unfinished". Each finished leaf L is la- 

beled with either a clause from GGT n or a clause that occurs to the 
left of L in the postorder traversal of R. 

e. For an unfinished leaf labeled with clause C, the set r(C) is a partial 

specification of a partial order. Furthermore, letting tt be the bipartite 
partial order associated with r(C), the clause C is equal to (V^)- 

Property e. is particularly crucial and is novel to our construction. As 
shown below, each unfinished leaf, labeled with a clause C = (\Jtt), will be 
replaced by a derivation S. The derivation S often will be based on P n , and 
thus might be expected to end with exactly the clause C; however, some of 
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the resolution inferences needed for P n might be disallowed by the regularity 
property of pool resolution proofs. This can mean that S will instead be a 
derivation of a clause C such that CCC'C C + . The condition C Q C + 
is required because any literal x £ C \ C will be handled by modifying 
the refutation R by propagating x downward in R until reaching a clause 
that already contains x. The condition C C C + ensures that such a clause 
exists. The fact that C" D C will mean that enough literals are present 
for the derivation to use only (non-degenerate) resolution inferences — by 
virtue of the fact that our constructions will pick C so that it contains the 
literals that must be present for use as resolution literals. The extra literals 
in C'\C will be handled by propagating them down the proof to where they 
are resolved on. 

The construction begins by letting R be the "empty" refutation, con- 
taining just the empty clause. Of course, this clause is an unfinished leaf, 
and r(0) = 0. Thus Rq is a valid LR partial refutation. 

For the induction step, Ri has been constructed already. Let C be the 
leftmost unfinished clause in Ri. will be formed by replacing C by a 

refutation S of some clause C such that CCC'C C + . 

We need to describe the (LR partial) refutation S. Let ir be the bipartite 
partial order associated with t(C), and consider the derivation P n from 
Lemma HI Since C is (\Jtt) by condition e., the final line of P n is the 
clause C. The intuition is that we would like to let S be P n . The first 
difficulty with this is that P^ is dag-like, and the Li?-refutation is intended to 
be tree-like, This difficulty, however, can be circumvented by just expanding 
P n , which is regular, into a tree- like regular derivation with lemmas by the 
simple expedient of using a depth- first traversal of P^. The second, and 
more serious, difficulty is that P v is a derivation from GT n , not GGT n . 
Namely, the derivation P^ uses the transitivity clauses of GT n as initial 
clauses instead of the guarded transitivity clauses of GGT n . The transitivity 
clauses Tij t k := Xij VSj^Vx^ in P n are handled one at a time as described 
below. We will use four separate constructions: in case (i), no change to P n 
is required; cases (ii) and (iii) require small changes; and in the fourth case, 
the subproof P v is abandoned in favor of "learning" the transitivity clause. 

By the remark made after Lemma HI no literal in C + is used as a reso- 
lution literal in P n . 

(i) If a transitivity clause j ^ of P n already appears earlier in Ri (that 
is, to the left of C), then it is already learned, and can be used freely 
in P w . 

In the remaining cases (ii)-(iv), the transitivity clause Tij k is not yet learned. 
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Let the guard variable for be x r>s , so r = r(i,j, k) and s = s(i,j, k). 

(ii) Suppose case (i) does not apply and that the guard variable x y g or its 
negation x rs is a member of C + . The guard variable thus is used 
as a resolution variable somewhere along the branch from the root to 
clause C. Then, as mentioned above, Lemma [4] implies that x T)S is not 
resolved on in P n . Therefore, we can add the literal x f g or x j> g (respec- 
tively) to the clause T^j^ and to every clause on any path below Tjj,* 
until reaching a clause that already contains that literal. This replaces 
Tij fi with one of the initial clauses T^j & V x r>s or If V x T)S of GGT n . 
By construction, it preserves the validity of the resolution inferences 
of R. L as well as the regularity property. Note this adds the literal x r>s 
or x rs to the final clause C of the modified P n . This maintains the 
property that C C C C C + . 

(Hi) Suppose case (i) does not apply and that x r ^ s is not used as a resolution 
variable anywhere below 1$ ^ in P n and is not a member of C + . In this 
case, P n is modified so as to derive the clause T^j^ from the two GGT n 
clauses Tj^fc V x TjS and Ti j k Vx r)S by resolving on x r>s . This maintains 
the regularity of the derivation. It also means that henceforth Tjj fc 
will be learned. 

If all of the transitivity clauses in P^ can be handled by cases then 
we use P n to define Ri+\. Namely, let P^ be the derivation P n as modified 
by the applications of cases (ii) and (Hi). The derivation P^ is regular and 
dag-like, so we can recast it as a tree-like derivation S with lemmas, by using 
a depth- first traversal of P' n . The size of S is linear in the size of P^, since 
only input lemmas need to be repeated. The final line of S is the clause C , 
namely C plus the literals introduced by case (ii). The derivation Ri+\ 
is formed from R{ by replacing the clause C with the derivation S of C , 
and then propagating each new literal x € C \ C down towards the root 
of Ri, adding x to each clause below S until reaching a clause that already 
contains x. The derivation S contains no unfinished leaf, so Ri+i contains 
one fewer unfinished leaves than Ri. 

On the other hand, if even one transitivity axiom Tj ^ in P^ is not 
covered by the above three cases, then case (iv) must be used instead. This 
introduces a completely different construction to form S: 

(iv) Let Tijk be any transitivity axiom in P v that is not covered by cases 
(i)-(iii). In this case, the guard variable X ip § IS used resolution 
variable in P n somewhere below Tjj in general, this means we can- 
not use resolution on x r ^ s to derive Tjj & while maintaining the desired 
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pool property. Hence, P n is no longer used, and we instead will form 
S with a short left-branching path that "learns" j ^. This will gen- 
erate two or three new unfinished leaf nodes. Since unfinished leaf 
nodes in a LR partial derivation must be labeled with clauses from 
bipartite partial orders, it is also necessary to attach short derivations 
to these unfinished leaf nodes to make the unfinished leaf clauses of S 
correspond correctly to bipartite partial orders. These unfinished leaf 
nodes are then kept in Ri+i to be handled at later stages. 

There are separate constructions depending on whether is a 

clause of type (/3) or (7); details are given below. 

First suppose Tij t k is of type (7), and thus Xj t k appears in C. (Refer to 
Figure [2j) Let x rjS be the guard variable for the transitivity axiom ly^. 
The derivation S will have the form 



■> %j,ki x k,ii x r,s > %j,kj x k,ii x r,s 

x i,ji x j,ki %k,i x i,ji x i,ki ^ —[jk;jR{i)\ ^' 

x i,ji x j,ki 7r — [jk;jR(i)] x j,ii x j,ki ^ — [jk;iR(j)] 

The notation T?-Uk] denotes the disjunction of the negations of the literals 
in 7r omitting the literal Xj^- We write u iR(J)" to indicate literals Xi/ such 
that j ^ n t. (The "-R(j)" means "range of j".) Thus K-[jk;iR(j)] denotes the 
clause containing the negations of the literals in it, omitting Xj t k and any 
literals x^ such that j -< % £. The clause n-\jk;jR(i)] * s defined similarly. 

The upper leftmost inference of S is a resolution inference on the vari- 
able x r>s . Since ^ is not covered by either case (i) or (ii), the variable x r>s 
is not in C + . Thus, this use of resolution variable does not violate 

regularity. Furthermore, since is of type (7), we have i-fi T tc)ji 3^t{C)^i 
i-fi T (C)k, and k-fi T ic)i- Thus the literals Xij and x^k are not in C + , so they 
also can be resolved on without violating regularity. 

Let C\ and C2 be the final clauses of S± and S2, and let C{~ be the 
clause below C\ and above C. The set TiC?) is obtained by adding (j, i) 
to r(C), and similarly T(Cj~) is r(C) plus Since T^j^ is type (7), 

we have i,j E M n . Therefore, since t(C) is a partial specification of a 
partial order, t(C*2) and r(C ] ~) are also both partial specifications of partial 
orders. Let 7T2 and ni be the bipartite orders associated with these two 
partial specifications (respectively). We will form the subproof Si so that 
it contains the clause (V^i) as its only unfinished clause. This will require 
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adding inferences in Si which add and remove the appropriate literals. The 
first step of this type already occurs in going up from C-f to C\ since this 
has removed Xj t k and added Wik, reflecting the fact that j is not 7Ti-minimal 
and thus & £ 7ri but Xj^ £ 7Ti. Similarly, we will form 52 so that its only 
unfinished clause is (V^)- 

We first describe the subproof 52 of 5. The situation is pictured in 
Figure El which shows an extract from Figure [2j the edges shown in part (a) 
of the figure correspond to the literals present in the final line Ci of 52- In 
particular, recall that the literals Xi/ such that j -< n t are omitted from 
the last line of 52- (Correspondingly, the edge from i to ii is omitted from 
Figure El) The last line Ci of 52 may not correspond to a bipartite partial 
order as it may not partition [n] into minimal and non-minimal elements; 
thus, the last line of 52 may not qualify to be an unfinished node of Ri+i- 
(An example of this in FigureE^a) is that j -< T (c 2 ) * ^t(c* 2 ) ^2, corresponding 
to Wjj and Xi t e 2 being in the last line of 52.) The bipartite partial order 7T2 
associated with t(C2) is equal to the bipartite partial order that agrees 
with tt except that each i ~< n £ condition is replaced with the condition 
j -< n2 £■ (This is represented in Figure E^b) by the fact that the edge from 
i to £2 has been replaced by the edge from j to £2- Note that the vertex i is 
no longer a minimal element of tt2] that is, i ^ M n2 .) We wish to form 52 to 
be a regular derivation of the clause ^j,i^-\jk;iR(j)] from the clause (V^)- 

The subproof of 52 for replacing Xi/ 2 in 7f with Xj/ 2 in If 2 is as follows, 
letting W* be ^-[jk;iR(j);it 2 y 

S'2 ■ . : . • • . : . • rest of 52 

%j,ki -^1,1.21 ^ 

The part labeled "rest of 52" will handle similarly the other literals £ such 
that i -< n £ and j I. The final line of 5 2 is the transitivity axiom Tj^^ 2 . 
This is a GT n axiom, not a GGT„ axiom; however, it can be handled by 
the methods of cases Namely, if Tj^^ 2 has already been learned by 

appearing somewhere to the left in Ri, then S' 2 is just this single clause. 
Otherwise, let the guard variable for Tj ^ 2 be av iS '. If used as a 

resolution variable below Tj^^ 2 , then replace Tj s i s £ 3 with Tj^^ 2 V x r > :S i or 
Tj^ 2 V v jS ', and propagate the av,s' or x r i <s i to clauses down the branch 
leading to Tj^^ 2 until reaching a clause that already contains that literal. 
Finally, if x r > )S i has not been used as a resolution variable in Ri below C, 
then let S' 2 consist of a resolution inference deriving (and learning) Tj n 2 
from the clauses Tj^^x r i a > and Tj i£ 2 ,x r t s r. 
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Figure 3: The partial orders for the fragment of 52 shown in ([I]). 

To complete the construction of S2, the inference (pQ) is repeated for 
each value of £ such that i <^ £ and j -fi n I. The result is that S2 has one 
unfinished leaf clause, and it is labeled with the clause (V^)- 

We next describe the subproof S\ of S. The situation is shown in Fig- 
ure HI As in the formation of S2 , the final clause C\ in S\ may need to be 
modified in order to correspond to the bipartite partial order tti which is 
associated with r(Ci). First, note that the literal Xj t k is already replaced 
by Wik in the final clause of Si. The other change that is needed is that, for 
every t such that j -< n £ and i -fi n £, we must replace xj j with x^i since we 
have j 7 ^ 7ri £ and i -< ni £. Vertex £3 in Figure 0] is an example of a such a 
value £. The ordering in the final clause of Si is shown in part (a), and the 
desired ordered pairs of tt\ are shown in part (b). Note that j is no longer 
a minimal element in 7Ti. 

The replacement of Xj t e 3 with Xi t e 3 is effected by the following inference, 
letting 7f* now be ^-\jk;jR{i);jh\- 

Si • . ; . ■ ■ . : . ■ rest of Si 

•^i,ki *^i/3) ^ 

The "rest of Si" will handle similarly the other literals £ such that j -< n £ and 
i £. Note that the final clause of S[ is the transitivity axiom Tiji 3 . The 
subproof S[ is formed in exactly the same way that S' 2 was formed above. 
Namely, depending on the status of the guard variable x r i >s > for Tij^, one 
of the following is done: (i) the clause ?ij^ 3 is already learned and can be 
used as is, or {it) one of x r i s i or x r > s t is added to the clause and propagated 
down the proof, or {Hi) the clause Tij^ 3 is inferred using resolution on x r ' jS > 
and becomes learned. 

To complete the construction of Si, the inference is repeated for 
each value of £ such that j ^ £ and i -fi n £. The result is that Si has one 
unfinished leaf clause, and it corresponds to the bipartite partial order m. 

That completes the construction of the subproof S for the subcase of {iv) 
where T^j^ is of type (7). Now suppose T^j^ is of type (For instance, 
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(a.) X\^k , Xjjig j x i,j j 7T 



Figure 4: The partial orders for the fragment of Si shown in ([2]). 



the values i,j,k' of Figure [2j) In this case the derivation S will have the 
form 



^ifcj",* x i,ji x i,k, 71 '-[jR(i),kR(iUj)\ 

x i,ji x j,ki 7r -[j_R(i),fci?(iUi)] 

W 

where x rjS is the guard variable for T^j^. We write [7f-[jjj(infe)]] to mean the 
negations of literals in ir omitting any literal Xj£ such that both % £ and 
k -< n I. Similarly, ^-URU) t kR(iuj)] indicates the negations of literals in ir, 
omitting the literals Xji such that i -< n £ and the literals x~k,i such that 
either i -< n £ or j I. 

Note that the resolution on x rjS used to derive Tij^ does not violate reg- 
ularity, since otherwise Tijk would have been covered by case (ii). Likewise, 
the resolutions and Xj t k do not violate regularity since Tj is 

of type 08). 

The subproof S$ is formed exactly like the subproof S2 above, with 
the exception that now the literal xa^ is not present. Thus we omit the 
description of S5. 

We next describe the construction of the subproof $4. Let C4 be the 
final clause of £4; it is easy to check that t(C±) is a partial specification of 
a partial order. As before, we must derive C4 from the clause (V^) where 
7T4 is the bipartite partial order associated with the partial order t{C±). A 
typical situation is shown in Figure [5j As pictured there, it is necessary 
to add the literals Xi t £ such that j -< n £ and i -fi n £, while removing Xj,e; 
examples of this are £ equal to £2 and £3 in Figure [5j At the same time, we 
must add the literals Xk,e such that j ^ n £ and k -fi n £, while removing Xj i; 
examples of this are £ equal to £\ and, again, £2 in the same figure. 

For a vertex £3 such that j -< n £3 and k -< n £3 but i -fi n £3, this is done 



S 4 -. : .• 

x i,jt x k,ji 7T— \jR(ink)] 
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Figure 5: The partial orders as changed by S4. 

similarly to the inferences ([U and ([2]) but without the side literal Xj^: 
5*4 • . : . • • . : . • rest of £4 

Here 7f* is 7T-Ui?(jn&w 3 ]- The transitivity axiom Iij^ 3 shown as the last 
line of S4 is handled exactly as before. This construction is repeated for all 
such £ 3 's. 

The vertices £\ such that j -< n l\ and j -< w t\ but A; 7^ l\ are handled 
in exactly the same way. (The side literals tt* change each time to reflect 
the literals that have already been replaced.) 

Finally, consider a vertex £2 such that i 7^ £2 and j -< n £2 and k -fi n £2- 
This is handled by the derivation 

S'l' • . : . • • . : . ■ rest of S4 

■^k,ji %j,l2 » %£2,k j ^i,£2 >-^k,ji ^k,i2 j 

As before, the set 7r* of side literals is changed to reflect the literals that 
have already been added and removed as S4 is being created. The subproofs 
S'l and S'l' of the transitivity axioms Tij^ 2 and Tkj/ 2 are handled exactly 
as before, depending on the status of their guard variables. 

Finally, we describe how to form the subproof S3 . For this, we must 
form the bipartite partial order ir^ which is associated with the partial or- 
der r(Cs), where C3 is the final clause of S3. To obtain 7F3, we need to add 
the literals x^ such that i £ and such that either j -< w £ or k -<„■ £, 
while removing any literals Xj t £ and x~k,t- This is done by exactly the same 
construction used above in ([3]). The literals in Tf-[jR(i)-kR(iuj)] are exactly 
the literals needed to carry this out. The construction is quite similar to the 
above constructions, and we omit any further description. 



on 
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That completes the description of how to construct the LR partial refu- 
tations Ri. The process stops once some Ri has no unfinished clauses. We 
claim that the process stops after polynomially many stages. 

To prove this, recall that H4+1 is formed by handling the leftmost unfin- 
ished clause using one of cases (i)-(iv). In the first three cases, the unfinished 
clause is replaced by a derivation based on P n for some bipartite order tt. 
Since P^ has size 0(n 3 ), this means that the number of clauses in 
is at most the number of clauses in Ri plus 0(n 3 ). Also, by construction, 
Ri+l has one fewer unfinished clauses than R^. In case (iv) however, Ri + \ 
is formed by adding up to 0(n) many clauses to Ri plus adding either two 
or three new unfinished leaf clauses. In addition, case (iv) always causes 
at least one transitivity axiom T%j^ to be learned. Therefore, case (iv) can 
occur at most 2Q) = 0(n 3 ) times. Consequently at most 3 • 2(g) = 0(n 3 ) 
many unfinished clauses are added throughout the entire process. It fol- 
lows that the process stops with Ri having no unfinished clauses for some 
i < 6(3) = 0(n 3 ). Therefore there is a pool refutation of GGT n with 0(n e ) 
lines. Since the GGT n principle has 0(n 3 ) many clauses, the number of 
inferences in the refutation is bounded by a quadratic polynomial of the 
number of the clauses being refuted. 

By inspection, each clause in the refutation contains 0(n 2 ) literals. This 
is because the largest clauses are those corresponding to (small modifications 
of) bipartite partial orders, and because bipartite partial orders can contain 
at most 0(n 2 ) many ordered pairs. Furthermore, the refutations P n for the 
graph tautology GT n contain only clauses of size 0(n 2 ). 
Q.E.D. Theorem □ □ 

Theorem [2] is proved with nearly the same construction. In fact, the 
only change needed is the construction of S from P^. Recall that in the 
proof of Theorem [IJ the pool derivation S was formed by using a depth- 
first traversal of P^. This is not sufficient for Theorem [2j since now the 
derivation S must use only input lemmas. Instead, we use Theorem 3.3 
of [12], which states that a (regular) dag- like resolution derivation can be 
transformed into a (regular) tree-like derivation with input lemmas. Forming 
S in this way from P^ suffices for the proof of Theorem [2) the lemmas of S 
are either transitive closure axioms derived earlier in Ri or are derived by 
input subproofs earlier in the post-ordering of S. Since the transitive closure 
axioms that appeared earlier in Ri were derived by resolving two GGT n 
axioms, the lemmas used in S are all input lemmas. 

The transformation of Theorem 3.3 of [12] may multiply the size of the 
derivation by the depth of the original derivation. Since it is possible to form 
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the proofs P v with depth O(n), the overall size of the regRTI refutation is 
0(n 7 ). This completes the proof of Theorem [2j □ 



4 Greedy, unit-propagating DPLL with clause learn- 
ing 

This section discusses how the refutations in Theorems [1] and [2] can be mod- 
ified so as to ensure that the refutations are greedy and unit-propagating. 

Definition Let R be a tree-like regular w-resolution refutation with input 
lemmas. Let T(C) be the set of clauses of V plus every clause D <r C in R 
that has been derived by an input subproof and thus is available as a learned 
clause to aid in the derivation of C. 

The refutation R is greedy and unit-propagating provided that, for each 
clause C of R, if there is an input derivation from T(C) of some clause 
C C C + which does not resolve on any literal in C + , then C is derived in R 
by such a derivation. 

Note that, as proved in [3], the condition that there is a input deriva- 
tion from r(C) of some C C C + which does not resolve on C + literals 
is equivalent to the condition that if all literals of C + are set false then 
unit propagation yields a contradiction from V{C). (In [3], these are called 
"trivial" proofs.) This justifies the terminology "unit-propagating". 

The definition of "greedy and unit-propagating" is actually a bit more re- 
strictive than necessary, since DPLL algorithms may actually learn multiple 
clauses at once, and this can mean that C is not derived from a single input 
proof but rather from a combination of several input proofs as described in 
the proof of Theorem 5.1 in [12J. 

Theorem 5 The guarded graph tautology formulas GGT n have greedy, unit- 
propagating, polynomial size, tree-like, regular w-resolution refutations with 
input lemmas. 

Proof We indicate how to modify the proofs of Theorems [T] and [2j We 
again build tree-like LR partial refutations satisfying the same properties 
a.-e. as before, except now w-resolution inferences are permitted. Instead of 
being formed in distinct stages Rq , R% , R2 , ■ ■ ■ , the w-resolution refutation R 
is constructed by one continuing process. This construction incorporates all 
of transformations (i)-(iv) and also incorporates the construction of Theorem 
3.3 of p]. 
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At each point in the construction, we will be scanning the so-far con- 
structed partial w-resolution refutation R in preorder, namely in depth- first, 
left-to-right order. That is to say, the construction recursively processes a 
node in the proof tree, then its left subtree, and then its right subtree. Dur- 
ing most steps of the preorder scan, the partial refutation R is modified by 
changing the part that comes subsequently in the preorder, but the con- 
struction may also add and remove literals from clauses below the current 
clause C. When the preorder scan reaches a clause C that has an input 
derivation R' from T(C) of some C'CC that does not resolve on C + , then 
some such R' is inserted into R at that point. When the preorder scan 
reaches an unfinished leaf C = Co, then a (possibly exponentially large) 
derivation P* is added as its derivation. The construction continues pro- 
cessing R by scanning P* in preorder, with the end result that either (1) P* 
is successfully processed and reduced to only polynomial size or (2) the 
preorder scan of P* reaches a transitivity clause Tjj^ of the type that trig- 
gered case (iv) of Theorem [TJ In the latter case, the preorder scan backs up 
to the root clause Co of P* , replaces P* with the derivation S constructed 
in case (iv) of Theorem HJ and restarts the preorder scan at clause Co- 

We describe the actions of the preorder scan in more detail. Initially, 
R is the "empty" derivation, with the empty clause as its only (unfinished) 
clause. A clause C encountered during the preorder scan of R is handled by 
one of the following. 

(«') Suppose that some C C C + can be derived by an input derivation 
from r(C) that does not resolve on any literals of C + . Fix any 
such C C C + , and replace the subderivation in R of the clause C 
with such a derivation of C from T(C). Any extra literals in C \C 
are in C + and are propagated down until reaching a clause where they 
already appear, or occur as a phantom literal. There may also be lit- 
erals in C \C: these literals are removed as necessary from clauses 
below C in R to maintain the property of R containing correct w- 
resolution inferences. Note that this can convert resolution inferences 
into w-resolution inferences. The clause C is now a learned clause. 

Note that this case includes transitivity clauses C = C = I* j fc that 
satisfy the conditions of cases (i)-(iii) of Theorem Q] 

(ii ') If case (i') does not apply, and C is not a leaf node, then R is unchanged 
at this point and the depth-first traversal proceeds to the next clause. 

(Hi') If C is an unfinished clause of the form (V^)> let P n be as before. 
Recall that no literal in C + is resolved on in P n . Unwind the proof P n 
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into a tree-like regular refutation P* that is possibly exponentially big, 
and attach P* to R as a proof of C. Mark the position of C by setting 
Co = C in case it is necessary to later backtrack to C. Then continue 
the preorder scan by traversing into P*. 

(iv') Otherwise, C is a leaf clause of the form Tij^ and since case (i') does 
not apply, one of life's guard literals x, namely x — oCf § or x — 
Xy g . is in C + . If C is not inside the most recently added P* or if 
x € Cq , then replace Tijfc with Tjj^ V x, and propagate the literal X 
downward in the refutation until reaching a clause where it appears as 
a literal or a phantom literal. Otherwise, the preorder scan backtracks 
to the root clause Co of P* , and replaces P* with the partial resolution 
refutation S formed in case (iv) of Theorem [TJ 

It is clear that this process eventually halts with a valid greedy, unit- 
propagating, tree-like w-resolution refutation. We claim that it also yields 
a polynomial size refutation. Consider what happens when a derivation P* 
is inserted. If case (iv') is triggered, then the proof S is inserted in place 
of P*, so the size of P* does not matter. If case (iv') is not triggered, then, 
as in the proof of Theorem 3.3 of |12| . the preorder scan of P* modifies 
(the possibly exponentially large) P* to have polynomial size. Indeed, as 
argued in [TSj , any clause C in P* will occur at most dc times in the modified 
version of P* where dc is the depth of the derivation of C in the original P n . 
This is because C will have been learned by an input derivation once it has 
appeared no more than dc times in the modified derivation P*. This is 
proved by induction on dc- 

Consider the situation where S has just been inserted in place of P* in 
case (iii'). The transitivity clause Tjj^ is not yet learned at this point, since 
otherwise case (i') would have applied. We claim, however, that 2**-* is 
learned as S is traversed. To prove this, since 1}^ fc is manifestly derived by 
an input derivation and since its guard literals x rjS and x r ,s do not appear 
in Cq, it is enough to show that the clause lijfc is reached in the preorder 
traversal scan of S. This, however, is an immediate consequence of the fact 
that Tij : k was reached in the preorder scan of P* and triggered case (iv'), 
since if case (i') applies to or to any clause below Tj in the preorder 
scan of S, then it certainly also applies Tij^ or some clause below T^j^ in 
the preorder scan of P*. 

The size of the final refutation R is bounded the same way as in the 
proof of Theorem [21 and this completes the proof of Theorem [5j □ 
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Theorem 6 There are DPLL search procedures with clause learning which 
are greedy, unit-propagating, but do not use restarts, that refute the GGT n 
clauses in polynomial time. 

We give a sketch of the proof. The construction for the proof of Theo- 
rem requires only that the clauses T^j^ are learned whenever possible, and 
does not depend on whether any other clauses are learned. This means that 
the following algorithm for DPLL search with clause learning will always 
succeed in finding a refutation of the GGT n clauses: At each point, there is 
a partial assignment r. The search algorithm must do one of the following: 

(1) If unit propagation yields a contradiction, then learn a clause Tijk if 

possible, and backtrack. 

(2) Otherwise, if there are any literals in the transitive closure of the bi- 

partite partial order associated with r which are not assigned a value, 
branch on one of these literals to set its value. (One of the true or 
false assignments yields an immediate conflict, and may allow learning 
a clause Tij^-) 

(3) Otherwise, determine whether there is a clause 2* j fc which is used in the 

proof P n whose guard literals are resolved on in P n . (See Lemma SI) 
If not, do a DPLL traversal of P n , eventually backtracking from the 
assignment r. 

(4) Otherwise, some clause Tijk blocks P n from being traversed in polyno- 

mial time. Branch on its variables in the order given in the proof of 
Theorem [TJ From this, learn the clause lij fc- 

5 Guarded, xor-ified, pebbling tautologies 

This section gives polynomial size regRTI refutations for the guarded peb- 
bling tautologies which Urquhart |22j proved require exponential size regular 
resolution proofs. 

Definition A pointed dag G = (V, E) is a directed acyclic graph with a 
single sink t such that every vertex in G has indegree either or 2. The 
pebbling formula Peb(G) for a pointed dag G is the unsatisfiable formula in 
the variables x v for v € V consisting of the following clauses: 

(a) x s , for every source s 6 V, 
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(/?) x u V x v V x w , for every vertex to with two (immediate) predecessors u 
and v, 

(7) Xf, for t the sink vertex. 

We next define Urquhart's "xor-ification" of a pebbling tautology clause. 
Xor-ification, for two variables, is due originally to Alekhnovich and Razborov, 
see Ben-Sasson [5], and is similar to the "orification" used by [6]. The intu- 
ition for xor-ification is that each variable x u is replaced by a set of clauses 
which express the /c-fold exclusive or x Mj i © • • • © x u ^. 

Definition Let k > 0, and variable of the Peb(G) principle. Let 

x^ be x u , and x" 1 be its complement x u . Define xj;® to be the set of clauses 
of the form 

<\i V < 2 V • • • V x* k (4) 

where an even number of the values ij equal —1 (and the rest equal 1). 
Dually, define xjj® to be the set of clauses of the form with an odd 
number of the ij equal to —1. Note there are 2 fc ~ 1 clauses in each of x^® 
and x^®. If C is a clause C = Z\ V • • • V z#, each Zi of the form x u or x u , 
then C fc ® is the set of clauses of the form 

Ci V C 2 V • • • V Ci, 

where each Ci € . There are 

2 (fc-i)< 

many clauses in C ®. 

Definition The xor-ified pebbling formula Peb fc ®(G) is the set of clauses 
C fc ® for C 6 Peb(G). 

Definition Let G be a pointed graph with n vertices and k = k(n) > 0. 
Let p be a function with domain the set of clauses of Peb fc ®(G) and with 
range the set of variables x u ^ of Peb fc ®(G), such that, for all C, the variable 
p(C) is not used in C. The guarded xor-ified pebbling formula GPeb fc ®(Cr) 
is the set of clauses of the forms 

C V p(C) and C V pJC) 

for C € Peb fe ®(G). 

Note that, as in the case of GGT n , the GPeb fc ®(G) clauses depend on the 
choice of p; again this is suppressed in the notation. For a dag G with n 
vertices, the formula GPeb fc ®(G) consists of 0(2 3k n) clauses. 
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Our definitions of Peb fc ®(G) and GPeb fc ®(G) differ somewhat from Urquhart's, 
but these differences are inessential and make no difference to asymptotic 
proof sizes. 

Of course, the Peb fc ®(G) clauses are readily derivable from the GPeb fc ®(G) 
clauses by resolving on the guard literals as given by p. There are simple 
polynomial size regular resolution refutations of the Peb fc ®(G) principles; 
hence there are polynomial size, but not regular, resolution refutations of the 
GPeb fc ®(G) principles. Indeed, Urquhart [22] proved that there are pointed 
graphs G with n vertices and values k = k(n) = O(loglogn), and func- 
tions p, such that regular resolution refutations of the GPeb fc ®(G) clauses 
require size 2 n( - n /^ n ^ lo s lo g™)). 

We shall show that the GPeb fc ®(G) principles have polynomial size proofs 
in regRTI and in pool resolution: 

Theorem 7 The guarded xor-ified pebbling formulas GPeb fc ®(G) have poly- 
nomial size regRTI refutations, and thus polynomial size pool refutations. 

We conjecture that, analogously to Theorem (U the GPeb fe ®(G) principles 
can be shown unsatisfiable by polynomial size, greedy, unit propagating 
DPLL clause learning; however, we have not attempted to prove this. 

We make some simple observations about working with xor-ified clauses 
before proving Theorem [3 

Lemma 8 Let u be a vertex in G. There is a tree-like regular refutation of 
the clauses in x^® and x^® with 2 k — 1 resolution inferences. Its resolution 
variables are the variables x u %. 

Proof This is immediate by inspection: the refutation consists of resolving 
on the literals x Uy i successively for i = 1,2,... ,k, giving a proof of height k. 
The leaf clauses of the proof are the members of x^® and x^®. □ 

The refutation of Lemma [8] can be viewed as being the "^©-translation" of 
the proof 

Xy, Xu 
I 

The next lemma describes a similar "^©-translation" of a proof 

C, x u D , x u 
C~D 
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Lemma 9 Let u be a vertex in G, and let C and D be clauses which do 
not contain either x u andx u . Then each clause of (CV D) k ® has a tree-like 
regular derivation from the clauses in (C V x u ) k ® and (D V x u ) k ® in which 
the variables used as resolution variables are exactly the variables x u ^. This 
derivation has 2 k — 1 resolution inferences, and 2 k leaf clauses. 

Proof Fix a clause E from (C V D) k ®\ we must describe its derivation from 
clauses in (C V x u ) k ® and (D \/x u ) k ®. Let Eq be the subclause of E which 
is from C k ®, and let Ed the subclause of E which is from D k ®. If C and D 
have non-empty intersection, Eq and Ed are not disjoint; however, in any 
event, E = E C U E D . 

Form the refutation from Lemma Then add Eq to every leaf clause 
from x®, add Ed to every leaf clause from x k ®, and add E to every non-leaf 
clause. This gives the desired derivation of E. □. 

Lemma [9] lets us further generalize the construction of fc©-translations of 
proofs. As a typical example, the next lemma gives the fc©-translation of 
the derivation 

X w 

Lemma 10 Let w be a vertex of G, and u and v its predecessors. Then, 
each clause in x*® has a dag-like resolution derivation P from the clauses in 
x k ® , x k ® , and (x u Vi„ V x w ) k ® . This derivation contains < 2 2k resolution 
inferences and resolves on the literals x u ^ and x v ^. In addition, the paths 
in P that lead to clauses in x k ® resolve on exactly the literals x v ^. 

Lemma [TU1 follows by applying Lemma [9] twice. □ 

It is important to note that the left-to-right order of the leaves of the 
derivation of Lemma [10] can be altered by changing the left-right order of 
hypotheses of resolution inferences. In particular, given any leaf clause D of 
a refutation P, we can order the hypotheses of the resolution inferences so 
that D is the leftmost leaf clause. This will be useful when D needs to be 
learned. 

Definition G \w is the induced subgraph of G with sink w and containing 
those vertices from which the vertex w is reachable. G[w] is the graph 
resulting from G by making the vertex w a leaf by removing its incoming 
edges, and then removing those vertices from which the sink vertex of G is 
no longer reachable. 
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The vertex u is an ancestor of w if u ^ w and u G Grfiu. We call u 
and v independent ancestors of w; provided u, v, and u; are distinct and 
u G (Gr fio) [i/] and w G (G |~u>) [n] . In this case, we write for = 

Gr[u][it]. Sometimes v, and possibly also u, may be missing or undefined; in 
these cases, G[u, v] means just G if both u and v are undefined, and means 
G[u] if u is present but v is undefined. 

Note that it is possible for u and v to be independent ancestors of w, 
and also have u an ancestor of v or vice-versa. 

The polynomial size regular resolution refutations of the Peb fc ®(G) prin- 
ciples also apply to subgraphs such as G\w, (G\w)[u] and (G\w)[u, v]. We 
can use these refutations to prove the following lemma. We write Peb^,®(G) 
to denote the ^©-translations of Peb(G) clauses of type (a) and (/?), omit- 
ting the clauses of type (7), and similarly for GPeb^®(G). 

Lemma 11 (i) Let w be a vertex of G. Then each clause in x^® has a regu- 
lar resolution derivation from the clauses Peb^(G\w). The derivation 
uses only resolution variables of the form x a ^ with a G (G\w) \ {w}. 

(ii) Let u and w be distinct vertices of G such that u is an ancestor of w. 

Then each clause in (x u V x w ) k ® has a regular resolution derivation 
from the clauses Veh^{{G\w)[u\) . The derivation uses only resolution 
variables of the form x a ^ for a G \ {u,w}. 

(iii) Let u and v be independent ancestors of w. Then each clause of (x u V 

x v \/x w ) k ® has a regular resolution derivation from the clauses Peb^((G|"ui)[u, v\). 

The derivation uses only resolution variables of the form x a ^ for a G 

({G\w)[u,v])\{u,v,w}. 
Ln all three cases, the derivation is dag-like and has size 0(2 3fc n) and height 
0(kn). 

Proof We describe regular refutations from Peb(G). Their ^©-translations 
will give the regular derivations of Lemma [TT1 

For (i), there is an obvious regular dag-like, size 0(n), derivation P 
of x w from the clauses (a) and (/3) of the (non-xorified) principle Peb(G); 
the derivation proceeds by resolving on literals Xu 111. £1 depth-first traversal 
of G. Forming the ^©-translation P k ® of P forms the desired derivation for 
part (i) of any given clause of x^®. Since P has size 0(n) and clauses in P 
have at most three literals, P fc ® has size 0(2 3fc n) and height O(kn). The 
limitation on which variables can be used as resolution variables follows by 
inspection; it is also a consequence of the fact that the refutation is regular. 

The proofs of (ii) and (iii) are similar. □ 
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Proof (of Theorem [71) We again construct a finite sequence of "LR partial 
refutations", denoted Ro, R\, R2, ■ ■ ■■ This terminates after finitely many 
steps with the desired refutation R. Each LR partial refutation Ri will 
be a correct regRTI proof (and thus a correct pool resolution refutation); 
furthermore, it will satisfy the following conditions: 

a. Ri is a tree of nodes labeled with clauses. The root is labeled with the 

empty clause. Each non-leaf node in Ri has a left child and a right 
child, and the clauses labeling these nodes form a valid resolution 
inference. 

b. For each clause C in Ri, the clause C + is defined as before as 

C + := {£ : The literal I occurs in some clause on the branch 

from the root node of Ri up to and including C}. 

c. Each leaf of Ri is either "finished" or "unfinished". Each finished node 

leaf L is labeled with either a clause from GPeb fc ®(G) or with a clause 
that was derived by an input subderivation of Ri to the left of L in 
postorder. The input subderivation may not contain any unfinished 
leaves. 

d. Each unfinished leaf is labeled with a clause C G E k ® for a clause E such 

that one of the following three possibilities I. -III. holds. I. E is of the 
form x w , and C+ contains no literal x a ,i for vertex a £ (G\w) \ {w}. 
II. E is of the form x u V x w with u an ancestor of w, and C+ contains 
no literal x a ^ for vertex a £ (Gfu;)[ti] \ {u, w}. Or, III. E is of the 
form x u V x v V x w with u and v independent ancestors of w, and C + 
contains no literal x a ,i for any vertex a £ (G\w)[u, v] \ {u,v,w}. 

We introduce a new notational convention to describe (sub)clauses in Ri. 
For w a vertex in G, the notation W or W' denotes a clause in x^®, and 
If or If to denotes a clause in x^®. The notation If or If in no way 
denotes the negation of W or W'\ instead, they are names of clauses, with 
the overline meant only to serve as a reminder of the semantic meaning. 

The initial refutation Rq is formed as follows. Let Q be the fc©-translation 
of the inference 

x t x t 
_L 
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as given by Lemma [H where t is the sink of G. There are 2 k leaf clauses 
of Q: half of them are labeled with a clause T E x^® and the other half 
are labeled with a clause T E a^®- Form i?o from Q by replacing each leaf 
clause T with a derivation 

T,p(T) = JT 1 gT) 
T 

These inferences are regular, since p(T) is not an a^j. The other leaf clauses, 
of the form T, satisfy condition d. and are unfinished clauses in Rq. 

For the inductive step, the LR partial refutation Ri will be transformed 
into Hi-fi. There are several cases to consider; the goal is to replace one 
unfinished leaf by a derivation containing only finished leaves, or to learn one 
more Peb fc ®(G) clause while adding only polynomially many more unfinished 
leaves. 

Consider the leftmost unfinished leaf of Ri. By condition d., its clause C 
will have one of the forms W, or U, W, or U, V, W where U E x^®, V E x^®, 
and W E x^®. By Lemma [TTl there is a dag-like regular refutation P 
of C from the clauses of Peb*f ((G|W)[it,v]). We wish to convert P into a 
derivation from the clauses of GPeb^ ((G\w)[u,v]) and the already learned 
clauses of R{. Consider each leaf clause D of P. Then D is a ^©-translation 
of a clause in PebJo((G\w)[u, v]). As in the proof of Theorem [TJ there are 
four cases to consider: 

(2) If the clause D is already learned as an input lemma in Ri to the left 
of C, then D may be used in P as is. 

For the remaining cases, assume D has not been learned as an input lemma. 

(ii) Let y = p(D). If either y or y is a member of C + , then add that literal 
to D and every clause on the path below D until reaching the first 
clause where it appears. This replaces D with the GPeb k ® p {{G\w)[u, v\) 
clause D V y or D V y . 

(Hi) Suppose cases (i) and (ii) do not apply and that y is not used as a 
resolution variable below D. In this case, replace D by a resolution 
inference deriving D from D\/y and -DVy. This preserves the regularity 
of the derivation. It also makes D a learned clause. 

It is possible that C itself is a Peb^,®(G) clause. If so, then C = D and P is 
the trivial derivation containing only C, and one of cases holds. 
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If all leaf clauses D of P can be treated by cases (z)-(m), then we have 
successfully transformed P into a (still dag-like) derivation P' which satisfies 
regularity and in which leaf clauses are from GPeb fe ® (G) or already learned 
as input lemmas in R{. By Theorem 3.3 of |12| . P' can be converted in a 
regRTI proof P" of the same conclusion as P, preserving the regularity con- 
ditions, and with the size of P' bounded by twice the product of the size of P 
and the height of P. Therefore, the size of P" is 0((2 3k n)(kn)) = 0(k2 3k n 2 ). 
Form .Rj+i by replacing the clause C in Ri with the derivation P" . Ri+\ sat- 
isfies conditions a.-d., and has one fewer unfinished clauses than 

However, if even one leaf clause D of P fails cases then the entire 

subderivation P is abandoned, and we chose some leaf clause D of P to be 
learned such that D does not fall into cases (i)-(m). 

The leaf clause D is the A:0-translation of an (a) or (/3) clause of Peb fc ® (G) 
and thus either has the form for some source e in G or has the form 

A, B, E where A G x k ® , B G x k ®, and E G x k ® for a, b, and e vertices 
in G \w with a and b the two predecessors of e in G. Without loss of gen- 
erality, b is not an ancestor of a in G; otherwise interchange a and b. The 
construction now splits into three cases I., II., and III. depending on the form 
of C. These three cases each split into two subcases depending whether D 
is E or is A, B, E. 

I. In the first case, C is equal to just W. Recall that W is a clause 
in x^®. First suppose D is equal to E. Consider the derivation structure 

x e x e , W 



W 

Note that (0) contains a blend of variables from Peb(G) (non-xorified) and 
from Peb fc ®(G) (xor-ified). However, we can still form its ^©-translation Q: 
the leaf clauses of Q are the 2 k clauses of the form E' G x k ® and of the 
form E , W for e' G x k ® . By choosing the appropriate left-right order for 
the hypotheses in Q, we arrange for D = E to be the leftmost leaf clause 
of Q. Let y = p{D). Then y is not one of the variables x e ^, nor is y or y 
in C + since condition (ii) does not hold for D. Therefore, we can modify Q 
by replacing D = E with 



D,p(D) D,p(D) (6) 



D 

Form -Ri+i from Ri by replacing C with the modified Q. This causes D 
to become learned as an input lemma in Ri+\. The other leaf clauses of Q 
all satisfy condition d. in -Rj+i and thus become unfinished clauses of Ri+%: 
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they are all to the right of D. This adds fewer than 2 k new unfinished clauses 
to Ri+i- 

Second suppose D equals A, B, E. For the moment, assume e ^ w. 
Consider the following derivation structure: 

X5, X e 



Define Q to be the ^©-translation of ([7]) , so Q is a tree-like derivation of the 
clause W from the clauses of the form A ,B ,E', the form A', the form B', 
and the form E ,W, where the clauses A' , A , B' , B , E' , and E range over 
all members of x^®, if®, x^®, x k ®, Xg®, and Xg®, respectively. 

The left-right order of hypotheses in Q is chosen so that the clause D is 
the leftmost leaf clause of Q. The variable y = p(D) is not one of the x a) j's, 
Xb/s, or x ei j's. Since condition (ii) does not hold, neither y nor y is in C + . 
Therefore, we again replace the clause D in Q with the inference ([6]). Form 
Ri+i by replacing the unfinished clause C in Ri with the derivation Q. The 
inference ([6]) does not violate regularity, and causes D to be learned as an 
input lemma in R%+\. 

It is easy to check that the remaining leaf clauses of Q, which have the 
forms A , and B , and E ,W, and A ,B ,E', satisfy condition d. These are 
thus valid as new unfinished clauses in A simple calculation shows 

that there are 2 • 2 3 ( fc ~ 1 ) + 2 2( - k ~ 1 * ) + 2 fc_1 < 2 3k many of these clauses. 

On the other hand, suppose e = w. In this case, use the derivation 
structure 

Xg,X b ,W Xg 

x b ,W Xb_ 

W 

instead of (|7|). Let Q be the ^©-translation of this, and argue again as 
above. We leave the details to the reader. 

This completes the construction of Ri+i in the case where C is just W . 
At least one new clause, D, from Peb fc ®(G) has been learned as an input 
lemma. Since D is learned in the leftmost branch of Q, it is available for use 
as a learned clause for all future unfinished clauses. Fewer than 2 3k many 
new unfinished leaves have been introduced. 

II. Now consider the case where C has the form U, W. First suppose D 
is equal to E £ x®, where e is a source of G. We have e / u since, even if 



X e ,W 



w 



(7) 
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u is a leaf, P does not use the axiom of type (a) for U. Thus e and u are 
independent ancestors of w. Consider the derivation structure 



x e x e ,U,W /M 

Let Q be the ^©-translation of ([8]) arranged so that D is its leftmost leaf 
clause. Now argue as in the earlier cases to form Ri+\. 

Second, suppose D is equal to A, B, E. We suppose that e ^ w, and leave 
the (slightly simpler) case of e = w to the reader^ For the moment, suppose 
u is not equal to a or b. Again, it is not possible for u to equal e, since P 
does not use the clause of type (/?) for u and its predecessors. Consider the 
derivation structure 



?l,X b ,X e ?2,£ 6 



?l,?2,3?e ? 3 ,X e ,W 

u,w 



(9) 



Here the subclauses ?i, ?2, ?3 are unknown and must be determined: in 
fact, one of them will be U and the other two will be empty. The rules for 
determining these are as follows. First, if u is an ancestor of a (and thus not 
equal to a), select 1\ to equal U. Second and otherwise, if u is an ancestor 
of 6, select ?2 to equal U. Otherwise, u and e are independent ancestors of w, 
so select ?3 to equal U. For example, in the second case, the derivation ([9]) 
becomes 



e 



■Cfcj %e U, Xfy 

U,X e X e , W 

u,w 

Now let Q be the ^©-translation of ©, with D the leftmost leaf clause 
of Q. Modify Q by replacing D with ([6]), and then form by replacing 
C with Q. The other leaf clauses of Q have the form A' or U,A', the 
form 5' or 17, 5', the form if, W or J7,lf, W, and the form Z', B~' , E' . By 
examination of which variables are used for resolution in Q, it is clear that 
condition d. is satisfied for these clauses, and thus they are valid unfinished 
clauses in Ri+i. 

This completes the description of Ri+\ in this case. The clause D has 
become newly learned, and < 2 3fc many new unfinished clauses have been 
introduced. 



2 The idea is to modify @ by omitting its final inference and changing the x e 's to W's. 
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Now suppose u = a, so U € xf>. (The case where u = b is similar.) In 
this case, the clause D has the form U,B,E for some B in x^® and some 
some E in Xg®. Let Q be the ^©-translation of 

U , X5, X e Xfo 

U,x e x e ,W 

u,w 

One of the leaf clauses in Q is equal to the clause D to be learned, and Q 
is ordered so that D is its leftmost leaf clause. As before, D is replaced 
in Q with ([6]) and then is formed by replacing C in Ri with Q. The 
clause D has become learned as an input lemma, and < 2 2k many new 
unfinished clauses have been introduced in R4+1. 

III. Third, consider the case where C has the form U,V,W. W.l.o.g., 
v is not an ancestor of u. First suppose D is E € x k ® for e a source in G. We 
again have e is not equal to u, v, or w. Since also e 6 (Gfu;)^, i>] and since 
u and f are independent ancestors of w, there must exist a path from wtoe 
that avoids u and a path from w to m that avoids e and -u, and a path 
from wtoi) that avoids e and n. By looking at the point where these three 
paths first diverge, there is a vertex / that lies on exactly two of these paths. 
Suppose, for instance, that / lies on the two paths from w to u and to e. 
(The other two cases are similar.) Then u and e are independent ancestors 
of /, and / and v are independent ancestors of w. Form the derivation 

structure 

x e U , x e , Xf 

U,x f V,x f ,W (10) 

u,v,w 

Let Q be the /c©-translation of (|10p with D as its leftmost clause and form 
Ri+i as before. 

Second suppose D is A, B, E. We assume that e/w, leaving the slightly 
simpler e = w case to the reader. (See the earlier footnote.) Suppose for 
the moment that u and v are distinct from a and b. As before, they cannot 
equal e. Consider the derivation structure 

?l,X b ,X e ? 2 ,X b 

?l,?2,S e h,X e ,W [ > 

u,v,w 

There are a number of subcases to consider; in each we describe how to set 
?i, ?2, and ?3. By default, 7%, ?2, and ?3 (when not specified otherwise) are 
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to be the empty clause. Some of the subcases are overlapping, and when so, 
either option may be used. For example, it can happen that u is both an 
ancestor of e and a member of (C* fiu) [e] . 

• Suppose u is an ancestor of e and and v 6 (G|"u;)[e]. 

o If u is an ancestor of a, set ?i := U and ?3 := V. 

o Otherwise, u is an ancestor of b, and we set ?2 := U and ?3 := V. 

• Suppose v is an ancestor of e, and u £ {G\w)[e\. This is handled like the 

previous case, interchanging U and V. 

• Suppose neither u nor v is in (G|iu)[e]. 

o If u is an ancestor of a and v an ancestor of 6, set ?i 
? 2 := F. 

o If n is an ancestor of 6 and v an ancestor of a, set ?i 
?2 :=^- 

o If u and t> are independent ancestors of a, set ?i := J7, V 
o If u and t> are independent ancestors of b, set ?2 := U, V 

These four subcases cover all possibilities since u and v are independent 
ancestors of w, and there is no path from w to either u or v which 
avoids e. 

If any of the above subcases hold, form Q as the fc©-translation of (jlip with 
D as its leftmost clause, and form iij+i by exactly the same construction as 
in the earlier cases. 

If, however neither u nor v is an ancestor of e, then (jlip cannot be used. 
Since also e G (G\w)[u, v], and w and v are independent ancestors of w, there 
must be a path from w to n that avoids v and e, a path from w to d that 
avoids u and e, and a path from w to e that avoids u and v. By looking at 
the point where these three paths first diverge, there is a vertex / that lies 
on exactly two of these paths. Suppose again that / lies on the two paths 
from w to u and to e. (The other two cases are similar.) Then, u and e are 
independent ancestors of /, and / and v are independent ancestors of w. 
Form the derivation structure 



:= U and 
:= V and 



U,x e ,Xf 



U,x f 



V,x f ,W 



(12) 



u.v.w 
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Let Q be the ^©-translation of (|12p with D as its leftmost leaf clause. Re- 
place D with ©, and then form Ri+\ by replacing with C with Q. The 
other leaf clauses of Q including those of the form V,F ,W and of the 
form U,E ,F' become valid unfinished clauses that satisfy condition d. The 
clause D has become learned as an input lemma, and Ri+\ has < 2 4fc new 
unfinished clauses. 

We still have to consider the cases where u, v, a and b are not distinct. 
There are four (very similar) cases where only one of u and v is in {a, b}. For 
instance, suppose that u = a and v ^ b. Then form Q as the ^©-translation 
of 

U,x b ,x e ? 1 ,x b 

U,V,W 

where ?i := V if v is an ancestor of b, and ?2 := V if v is not an ancestor 
of 6. Order Q so that D is its leftmost leaf clause, and form R4+1 as before. 

In the case where u = a and v = b, or vice-versa, let Q instead be the 
^©-translation of 

U,V,X e X e ,W 

U,V,W 

and proceed as before. 

This completes case III. and the construction of Ri+i from Ri. 

In cases an unfinished leaf is completely handled without adding 

a new unfinished clause. In cases (iv), a new Peb fc ® (G) clause is learned as an 
input lemma while adding fewer than 2 4fc many new unfinished leaves. There 
are fewer than n2 3 ( fc_1 ) many clauses that can be learned. Therefore the 
process of forming R^s terminates with a refutation R after a finite number 
of steps. The refutation R is a valid regRTI refutation of the GPeb fcffi (G) 
clauses. 

To estimate the size of R, note further that each time an unfinished leaf is 
handled by cases at most k2 3k n 2 many clauses are added. Therefore, 

the refutation R has size 0((2 3fc n)(2 4fc )(/c2 3fc n 2 ) = 0(2 llfe n 3 ). The overall 
size of the GPeb fc ®(Cr) clauses is Q(2 3k n). Thus the size of R is polynomially 
bounded by the size of the GPeb fc ®(G) principle; in fact, it is bounded by a 
degree four polynomial. 

This completes the proof of Theorem [71 □ 
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